CHAPTER 17 More of a Good Thing: Multiple Regression 245
»
» The multiple r2 value represents the amount of variability in the dependent
variable explained by the model, so you want it to be high. As shown in
Figure 17-2, it is 0.52 in this example, indicating a moderately good fit.»
» A statistically significant F statistic indicates that the model predicts the
outcome significantly better than the null model. As shown in Figure 17-2, the
p value on the F statistic is 0.009, which is statistically significant at α = 0.05.
Figure 17-4 shows another way to judge how well the model predicts the outcome.
It’s a graph of observed and predicted values of the outcome variable, with a
superimposed identity line (Observed Predicted). Your program may offer this
observed versus predicted graph, or you can generate it from the observed and pre-
dicted values of the dependent variable. For a perfect prediction model, the points
would lie exactly on the identity line. The correlation coefficient of these points is
the multiple r value for the regression.
Watching Out for Special Situations
that Arise in Multiple Regression
Here we describe two topics that come up in multiple regression: interactions
(both synergistic and anti-synergistic), and collinearity. Both relate to how the
simultaneous behavior of two predictors can influence an outcome.
FIGURE 17-4:
Observed versus
predicted
outcomes for the
model SBP ~ Age
+ Weight, for the
data in
Table 17-2.
© John Wiley & Sons, Inc.